Amazon Web Services (AWS) has announced the integration of S3 Object Lambda with Amazon Athena. This integration allows users to modify data stored in Amazon S3 buckets while it is being queried by Amazon Athena. S3 Object Lambda enables users to add custom code to S3 GET, HEAD, and LIST API requests, allowing for the transformation of data as it is returned to applications. A practical application of this feature is the ability to mask sensitive data columns during queries in Amazon Athena.
To utilize this feature, users must first create an S3 Object Lambda Access Point for their S3 bucket. When creating an Amazon Athena table, users can then specify the alias of this access point as the source location instead of a direct S3 bucket reference. This setup ensures that the Lambda function is automatically invoked during an Athena query.
The integration of S3 Object Lambda with Amazon Athena is available in all AWS Regions, with the exception of the AWS GovCloud (US) Regions. Users can access and manage S3 Object Lambda through various interfaces, including the AWS Management Console, AWS Command Line Interface (CLI), Application Programming Interface (API), and AWS Software Development Kit (SDK) client.
For those interested in the cost implications of using S3 Object Lambda, pricing details can be found on the Amazon S3 pricing page. Additional information about S3 Object Lambda, including its features and how to get started, is available on the product detail page and in the S3 user guide.