Skip to main content

AI Tech & Automation Blog

Build a drug discovery research assistant using Strands Agents and Amazon Bedrock
Jul 29, 2025
9 min

Build a drug discovery research assistant using Strands Agents and Amazon Bedrock

Drug discovery is a complex, time-intensive process that requires researchers to navigate vast amounts of scientific literature, clinical trial data, and molecu
Understanding Calendar mode for Dynamic Workload Scheduler: Reserve ML GPUs and TPUs
Jul 29, 2025
3 min

Understanding Calendar mode for Dynamic Workload Scheduler: Reserve ML GPUs and TPUs

Organizations need ML compute resources that can accommodate bursty peaks and periodic troughs. That means the consumption models for AI infrastructure need to
Build an intelligent eDiscovery solution using Amazon Bedrock Agents
Jul 26, 2025
11 min

Build an intelligent eDiscovery solution using Amazon Bedrock Agents

Legal teams spend bulk of their time manually reviewing documents during eDiscovery. This process involves analyzing electronically stored information across em
Your guide to taking an open model from discovery to a production-ready endpoint on Vertex AI
Jul 26, 2025
7 min

Your guide to taking an open model from discovery to a production-ready endpoint on Vertex AI

Developers building with gen AI are increasingly drawn to open models for their power and flexibility. But customizing and deploying them can be a huge challeng
Boost cold-start recommendations with vLLM on AWS Trainium
Jul 25, 2025
11 min

Boost cold-start recommendations with vLLM on AWS Trainium

Cold start in recommendation systems goes beyond just new user or new item problems—it’s the complete absence of personalized signals at launch. When someone fi
New Cluster Director features: Simplified GUI, managed Slurm, advanced observability
Jul 25, 2025
4 min

New Cluster Director features: Simplified GUI, managed Slurm, advanced observability

In April, we released Cluster Director, a unified management plane that makes deploying and managing large-scale AI infrastructure simpler and more intuitive th
Customize Amazon Nova in Amazon SageMaker AI using Direct Preference Optimization
Jul 24, 2025
19 min

Customize Amazon Nova in Amazon SageMaker AI using Direct Preference Optimization

At the AWS Summit in New York City , we introduced a comprehensive suite of model customization capabilities for Amazon Nova foundation models. Available as rea
25+ top gen AI how-to guides for enterprise
Jul 23, 2025
6 min

25+ top gen AI how-to guides for enterprise

The best way to learn AI is by building. From finding quick ways to deploy open models to building complex, multi-agentic systems, it’s easy to feel overwhelmed
Beyond accelerators: Lessons from building foundation models on AWS with Japan’s GENIAC program
Jul 23, 2025
12 min

Beyond accelerators: Lessons from building foundation models on AWS with Japan’s GENIAC program

In 2024, the Ministry of Economy, Trade and Industry (METI) launched the Generative AI Accelerator Challenge (GENIAC) —a Japanese national program to boost gene
Build an AI-powered automated summarization system with Amazon Bedrock and Amazon Transcribe using Terraform
Jul 22, 2025
23 min

Build an AI-powered automated summarization system with Amazon Bedrock and Amazon Transcribe using Terraform

Extracting meaningful insights from unstructured data presents significant challenges for many organizations. Meeting recordings, customer interactions, and int
The Gory Details of Finetuning SDXL and Wasting $16k
Jul 22, 2025
33 min

The Gory Details of Finetuning SDXL and Wasting $16k

Details on how the big diffusion model finetunes are trained is scarce, so just like with version 1 , and version 2 of my model bigASP, I'm sharing all the deta
Build real-time travel recommendations using AI agents on Amazon Bedrock
Jul 19, 2025
10 min

Build real-time travel recommendations using AI agents on Amazon Bedrock

Generative AI is transforming how businesses deliver personalized experiences across industries, including travel and hospitality. Travel agents are enhancing t
How to enable Secure Boot for your AI workloads
Jul 19, 2025
6 min

How to enable Secure Boot for your AI workloads

As organizations race to deploy powerful GPU-accelerated workloads, they might overlook a foundational step: ensuring the integrity of the system from the very
Cloud CISO Perspectives: Our Big Sleep agent makes a big leap, and other AI news
Jul 18, 2025
9 min

Cloud CISO Perspectives: Our Big Sleep agent makes a big leap, and other AI news

Welcome to the first Cloud CISO Perspectives for July 2025. Today, Sandra Joyce, vice president, Google Threat Intelligence, talks about an incredible milestone
Evaluating generative AI models with Amazon Nova LLM-as-a-Judge on Amazon SageMaker AI
Jul 18, 2025
20 min

Evaluating generative AI models with Amazon Nova LLM-as-a-Judge on Amazon SageMaker AI

Evaluating the performance of large language models (LLMs) goes beyond statistical metrics like perplexity or bilingual evaluation understudy (BLEU) scores. For
Accenture scales video analysis with Amazon Nova and Amazon Bedrock Agents
Jul 17, 2025
11 min

Accenture scales video analysis with Amazon Nova and Amazon Bedrock Agents

This post was written with Ilan Geller, Kamal Mannar, Debasmita Ghosh, and Nakul Aggarwal of Accenture. Video highlights offer a powerful way to boost audience
Build with more flexibility: New open models arrive in the Vertex AI Model Garden
Jul 17, 2025
4 min

Build with more flexibility: New open models arrive in the Vertex AI Model Garden

In our ongoing effort to provide businesses with the flexibility and choice needed to build innovative AI applications, we are expanding the catalog of open mod
Amazon Bedrock Knowledge Bases now supports Amazon OpenSearch Service Managed Cluster as vector store
Jul 16, 2025
32 min

Amazon Bedrock Knowledge Bases now supports Amazon OpenSearch Service Managed Cluster as vector store

Amazon Bedrock Knowledge Bases has extended its vector store options by enabling support for Amazon OpenSearch Service managed clusters, further strengthening i
Behind the Streams: Live at Netflix. Part 1
Jul 16, 2025
11 min

Behind the Streams: Live at Netflix. Part 1

Behind the Streams: Three Years Of Live at Netflix. Part 1. By Sergey Fedorov , Chris Pham , Flavio Ribeiro , Chris Newton , and Wei Wei Many great ideas at Net
How to enable real time semantic search and RAG applications with Dataflow ML
Jul 16, 2025
6 min

How to enable real time semantic search and RAG applications with Dataflow ML

Embeddings are a cornerstone of modern semantic search and Retrieval Augmented Generation (RAG) applications . In short, they enable applications to understand

Looking to Accelerate AI Projects?

Let's chat about your vision and projects!