Skip to main content

AI Tech & Automation Blog

Scaling high-performance inference cost-effectively
Sep 11, 2025
8 min

Scaling high-performance inference cost-effectively

At Google Cloud Next 2025, we announced new inference capabilities with GKE Inference Gateway , including support for vLLM on TPUs, Ironwood TPUs , and Anywhere
TII Falcon-H1 models now available on Amazon Bedrock Marketplace and Amazon SageMaker JumpStart
Sep 11, 2025
13 min

TII Falcon-H1 models now available on Amazon Bedrock Marketplace and Amazon SageMaker JumpStart

This post was co-authored with Jingwei Zuo from TII. We are excited to announce the availability of the Technology Innovation Institute (TII) ’s Falcon-H1 model
Introducing the Agentic SOC Workshops for security professionals
Sep 10, 2025
2 min

Introducing the Agentic SOC Workshops for security professionals

The security operations centers of the future will use agentic AI to enable intelligent automation of routine tasks, augment human decision-making, and streamli
Powering innovation at scale: How AWS is tackling AI infrastructure challenges
Sep 10, 2025
5 min

Powering innovation at scale: How AWS is tackling AI infrastructure challenges

As generative AI continues to transform how enterprises operate—and develop net new innovations—the infrastructure demands for training and deploying AI models
Maximize HyperPod Cluster utilization with HyperPod task governance fine-grained quota allocation
Sep 9, 2025
17 min

Maximize HyperPod Cluster utilization with HyperPod task governance fine-grained quota allocation

We are excited to announce the general availability of fine-grained compute and memory quota allocation with HyperPod task governance . With this capability, cu
Registration now open: Our no-cost, generative AI training and certification program for veterans
Sep 9, 2025
3 min

Registration now open: Our no-cost, generative AI training and certification program for veterans

Growing up in a Navy family instilled a strong sense of purpose in me. My father’s remarkable 42 years of naval service not only shaped my values, but inspired
Accelerating HPC and AI research in universities with Amazon SageMaker HyperPod
Sep 6, 2025
7 min

Accelerating HPC and AI research in universities with Amazon SageMaker HyperPod

This post was written with Mohamed Hossam of Brightskies. Research universities engaged in large-scale AI and high-performance computing (HPC) often face signif
Investigate fast with AI: Gemini Cloud Assist for Dataproc & Serverless for Apache Spark
Sep 6, 2025
5 min

Investigate fast with AI: Gemini Cloud Assist for Dataproc & Serverless for Apache Spark

Apache Spark is a fundamental part of most modern lakehouse architectures, and Google Cloud's Dataproc provides a powerful, fully managed platform for running S
It’s the Humidity: How International Researchers in Poland, Deep Learning and NVIDIA GPUs Could Change the Forecast
Sep 6, 2025
2 min

It’s the Humidity: How International Researchers in Poland, Deep Learning and NVIDIA GPUs Could Change the Forecast

For more than a century, meteorologists have chased storms with chalkboards, equations, and now, supercomputers. But for all the progress, they still stumble ov
Building a Solid Foundation: Best Practices for Local Redis Use with N8N
Sep 5, 2025
4 min

Building a Solid Foundation: Best Practices for Local Redis Use with N8N

Upgrade local N8N with Redis to gain faster execution, safe concurrency, and resilient state. A clear, practical path from laptop setup to production‑ready patterns.
Build character consistent storyboards using Amazon Nova in Amazon Bedrock – Part 2
Sep 5, 2025
12 min

Build character consistent storyboards using Amazon Nova in Amazon Bedrock – Part 2

Although careful prompt crafting can yield good results, achieving professional-grade visual consistency often requires adapting the underlying model itself. Bu
How Baseten achieves 225% better cost-performance for AI inference (and you can too)
Sep 5, 2025
5 min

How Baseten achieves 225% better cost-performance for AI inference (and you can too)

Baseten is one of a growing number of AI infrastructure providers, helping other startups run their models and experiments at speed and scale. Given the importa
Authenticate Amazon Q Business data accessors using a trusted token issuer
Sep 4, 2025
10 min

Authenticate Amazon Q Business data accessors using a trusted token issuer

Since its general availability in 2024, Amazon Q Business (Amazon Q) has enabled independent software vendors (ISVs) to enhance their Software as a Service (Saa
From query to cart: Inside Target’s search bar overhaul with AlloyDB AI
Sep 4, 2025
7 min

From query to cart: Inside Target’s search bar overhaul with AlloyDB AI

Editor’s note: Target set out to modernize its digital search experience to better match guest expectations and support more intuitive discovery across millions
The Agent Development Kit Hackathon with Google Cloud: Announcing the winners and highlights
Sep 3, 2025
3 min

The Agent Development Kit Hackathon with Google Cloud: Announcing the winners and highlights

The Agent Development Kit (ADK) Hackathon is officially wrapped. The hackathon wrapped up with over 10,400 participants from 62 countries, resulting in 477 subm
Train and deploy models on Amazon SageMaker HyperPod using the new HyperPod CLI and SDK
Sep 3, 2025
31 min

Train and deploy models on Amazon SageMaker HyperPod using the new HyperPod CLI and SDK

Training and deploying large AI models requires advanced distributed computing capabilities, but managing these distributed systems shouldn’t be complex for dat
Detect Amazon Bedrock misconfigurations with Datadog Cloud Security
Aug 30, 2025
9 min

Detect Amazon Bedrock misconfigurations with Datadog Cloud Security

This post was co-written with Nick Frichette and Vijay George from Datadog. As organizations increasingly adopt Amazon Bedrock for generative AI applications, p
OAuth Setup: The Unfiltered Guide for Builders
Aug 29, 2025
3 min

OAuth Setup: The Unfiltered Guide for Builders

Stop wasting time with confusing documentation. This is the direct, no-BS guide for builders to set up OAuth 2.0 for Google, GitHub, Apple, and more. Get your API keys and get back to work.
Meet Boti: The AI assistant transforming how the citizens of Buenos Aires access government information with Amazon Bedrock
Aug 29, 2025
14 min

Meet Boti: The AI assistant transforming how the citizens of Buenos Aires access government information with Amazon Bedrock

This post is co-written with Julieta Rappan, Macarena Blasi, and María Candela Blanco from the Government of the City of Buenos Aires. The Government of the Cit
Mercury foundation models from Inception Labs are now available in Amazon Bedrock Marketplace and Amazon SageMaker JumpStart
Aug 28, 2025
18 min

Mercury foundation models from Inception Labs are now available in Amazon Bedrock Marketplace and Amazon SageMaker JumpStart

Today, we are excited to announce that Mercury and Mercury Coder foundation models (FMs) from Inception Labs are available through Amazon Bedrock Marketplace an

Looking to Accelerate AI Projects?

Let's chat about your vision and projects!