
AWSPU Ep12: Building Future-Proof Data Lakes with Apache Iceberg on AWS
Failed to add items
Add to basket failed.
Add to Wish List failed.
Remove from Wish List failed.
Follow podcast failed
Unfollow podcast failed
-
Narrated by:
-
By:
About this listen
In this episode, we discuss the growing adoption of Apache Iceberg as a modern data lake format,
exploring how it's becoming the preferred choice for customers migrating from on-premises to cloud
-native architectures. The conversation covers AWS's deep integrations with Iceberg through
services like Amazon S3, AWS Glue Data Catalog, and SageMaker Lake House, as well as how key
partners like Starburst and Snowflake are enhancing the ecosystem with powerful connectors that
enable federated querying, flexible catalog management, and seamless data interoperability. We
highlight how organizations can easily migrate existing Parquet and ORC data lakes to Iceberg
without rewriting underlying data files, and emphasize that this shift represents a move toward
future-proof data architectures focused on ease of use, consistent governance, and reduced
development time rather than just cost optimization.
Contact us:
- aws-partner-unplugged-podcast-hosts@amazon.com
- Antony: https://www.linkedin.com/in/antony-prasad-thevaraj-04398518/
Links:
- https://aws.amazon.com/blogs/storage/build-a-managed-apache-iceberg-data-lake-using-starburst-and-amazon-s3-tables/
- https://aws.amazon.com/blogs/big-data/use-apache-iceberg-in-your-data-lake-with-amazon-s3-aws-glue-and-snowflake/