AWS re:Invent 2024 Digest

Key AI/ML/Data/Analytics announcement digest

Xin Cheng
5 min readDec 26, 2024

Gen AI

Amazon Nova, 2

  • Amazon Nova Micro is a text-only model that delivers the lowest latency responses at very low cost.
  • Amazon Nova Lite is a very low cost multimodal model that is lightning fast for processing image, video, and text inputs.
  • Amazon Nova Pro is a highly capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks.

Amazon Bedrock Marketplace: Access over 100 foundation models in one place

Reduce costs and latency with Amazon Bedrock Intelligent Prompt Routing and prompt caching (preview)
Route requests and cache frequently used context in prompts to reduce latency and balance performance with cost efficiency.

Amazon Bedrock Guardrails now supports multimodal toxicity detection with image support (preview)
Build responsible AI applications — Safeguard them against harmful text and image content with configurable filters and thresholds.

Prevent factual errors from LLM hallucinations with mathematically sound Automated Reasoning checks (preview)
Enhance conversational AI accuracy with Automated Reasoning checks — first and only gen AI safeguard that helps reduce hallucinations by encoding domain rules into verifiable policies.

Introducing multi-agent collaboration capability for Amazon Bedrock (preview)
With multi-agent collaboration on Amazon Bedrock, developers can build, deploy, and manage multiple specialized agents working together seamlessly to tackle more intricate, multi-step workflows.

Knowledge Bases for Amazon Bedrock now supports Amazon Aurora PostgreSQL and Cohere embedding models

Accelerate your generative AI application development with Amazon Bedrock Knowledge Bases Quick Create and Amazon Aurora Serverless

New Amazon Bedrock capabilities enhance data processing and retrieval
Amazon Bedrock enhances generative AI data analysis with multimodal processing, graph modeling, and structured querying, accelerating AI application development.

Build faster, more cost-efficient, highly accurate models with Amazon Bedrock Model Distillation (preview)
Automates the process of creating a distilled model for your specific use case by generating responses from a large foundation model and fine-tunes a smaller FM with the generated responses.

Amazon Q Business is adding new workflow automation capability and 50+ action integrations
Amazon Q Business extends productivity with generative AI-powered workflow automation capability and 50+ actions for enterprise efficiency, enabling seamless task execution across tools like ServiceNow, PagerDuty, and Asana.

Machine Learning

Simplify analytics and AI/ML with new Amazon SageMaker Lakehouse.

Unifying data silos, Amazon SageMaker Lakehouse seamlessly integrates S3 data lakes and Redshift warehouses, enabling unified analytics and AI/ML on a single data copy through open Apache Iceberg APIs and fine-grained access controls.

Meet your training timelines and budgets with new Amazon SageMaker HyperPod flexible training plans
Unlock efficient large model training with SageMaker HyperPod flexible training plans — find optimal compute resources and complete training within timelines and budgets.

Accelerate foundation model training and fine-tuning with new Amazon SageMaker HyperPod recipes
Get started with training and fine-tuning popular publicly available foundation models, like Llama 3.1 405B, in just minutes with state-of-the-art performance.

Maximize accelerator utilization for model development with new Amazon SageMaker HyperPod task governance
Enable priority-based resource allocation, fair-share utilization, and automated task preemption for optimal compute utilization across teams.

Analytics

Solve complex problems with new scenario analysis capability in Amazon Q in QuickSight
Find solutions to your most critical business challenges with ease. Amazon Q in QuickSight enables business users to perform complex scenario analysis up to 10x faster than spreadsheets.

Developer tools

New Amazon Q Developer agent capabilities include generating documentation, code reviews, and unit tests

Enhancing coding productivity, Amazon Q Developer agents now offer capabilities for auto-generating documentation, conducting code reviews, and creating unit tests within IDEs and GitLab.

Compute

Amazon EC2 Trn2 Instances and Trn2 UltraServers for AI/ML training and inference are now available

With 4x faster speed, 4x more memory bandwidth, 3x higher memory capacity than predecessors, and 30% higher floating-point operations, these instances deliver unprecedented compute power for ML training and gen AI. AWS Tranium2-powered EC2, Amazon EC2 Trn2 UltraServers

Amazon Elastic VMware Service: review of Amazon Elastic VMware Service (Amazon EVS), a new, native AWS service for customers to run VMware Cloud Foundation (VCF) within their Amazon Virtual Private Cloud (Amazon VPC).

Networking

Securely share AWS resources across VPC and account boundaries with PrivateLink, VPC Lattice, EventBridge, and Step Functions
Orchestrate hybrid workflows accessing private HTTPS endpoints — no more Lambda/SQS workarounds. EventBridge and Step Functions natively support private resources, simplifying cloud modernization.

Storage

New physical AWS Data Transfer Terminals let you upload to the cloud faster
Rapidly upload large datasets to AWS at blazing speeds with the new AWS Data Transfer Terminal, secure physical locations offering high throughput connection.

Announcing Amazon FSx Intelligent-Tiering, a new storage class for FSx for OpenZFS
Delivering NAS capabilities with automatic data tiering among frequently accessed, infrequent, and archival storage tiers, Amazon FSx Intelligent-Tiering offers high performance up to 400K IOPS, 20 GB/s throughput, seamless integration with AWS services.

New Amazon S3 Tables: Storage optimized for analytics workloads
Amazon S3 Tables optimize tabular data storage (like transactions and sensor readings) in Apache Iceberg, enabling high-performance, low-cost queries using Athena, EMR, and Spark.

Database

Amazon Aurora DSQL, the fastest serverless distributed SQL database for always available applications. It offers virtually unlimited scale, highest availability, and zero infrastructure management.

Amazon DynamoDB global tables previews multi-Region strong consistency

AWS Database Migration Service now automates time-intensive schema conversion tasks using generative AI

AWS DMS Schema Conversion converts up to 90% of your schema to accelerate your database migrations and reduce manual effort with the power of generative AI.

Security

New AWS Security Incident Response helps organizations respond to and recover from security events

AWS introduces a new service to streamline security event response, providing automated triage, coordinated communication, and expert guidance to recover from cybersecurity threats.

Containers

Streamline Kubernetes cluster management with new Amazon EKS Auto Mode
With EKS Auto Mode, AWS simplifies Kubernetes cluster management, automating compute, storage, and networking, enabling higher agility and performance while reducing operational overhead.

Use your on-premises infrastructure in Amazon EKS clusters with Amazon EKS Hybrid Nodes
Unify Kubernetes management across your cloud and on-premises environments with Amazon EKS Hybrid Nodes — use existing hardware while offloading control plane responsibilities to EKS for consistent operations.

Keynote

--

--

Xin Cheng
Xin Cheng

Written by Xin Cheng

Multi/Hybrid-cloud, Kubernetes, cloud-native, big data, machine learning, IoT developer/architect, 3x Azure-certified, 3x AWS-certified, 2x GCP-certified

No responses yet