AWS Lambda

Real Time Streaming With Kafka and Flink - Lab 6 Consume Data From Kafka Using Lambda

December 14, 20236 min read Data Streaming Real Time Streaming With Kafka and Flink Amazon MSK Apache Kafka AWS AWS Lambda Kpow Python

Amazon MSK can be configured as an event source of a Lambda function. Lambda internally polls for new messages from the event source and then synchronously invokes the target Lambda function. With this feature, we can develop a Kafka consumer application in serverless environment where developers can focus on application logic. In this lab, we will discuss how to create a Kafka consumer using a Lambda function.

October 26, 202314 min read Data Streaming Real Time Streaming With Kafka and Flink Amazon EventBridge Amazon MSK Apache Kafka AWS AWS Lambda Kpow Python

In this lab, we will create a Kafka producer application using AWS Lambda, which sends fake taxi ride data into a Kafka topic on Amazon MSK. A configurable number of the producer Lambda function will be invoked by an Amazon EventBridge schedule rule. In this way we are able to generate test data concurrently based on the desired volume of messages.

April 12, 202326 min read Data Streaming Apache Kafka AWS Glue Schema Registry AWS Lambda Kpow Python

Glue Schema Registry provides a centralized repository for managing and validating schemas for topic message data. Its features can be utilized by many AWS services when building data streaming applications. In this post, we will discuss how to integrate Python Kafka producer and consumer apps in AWS Lambda with the Glue Schema Registry.

March 14, 202312 min read Data Streaming Simplify Streaming Ingestion on AWS Amazon Athena Amazon MSK Apache Kafka AWS AWS Lambda Python

Streaming ingestion from Kafka (MSK) into Redshift and Athena can be much simpler as they now support direct integration. In part 2, we discuss an end-to-end streaming ingestion solution using EventBridge, Lambda, MSK and Athena. We also use AWS SAM integrated with Terraform for developing the producer Lambda function locally.

February 8, 202318 min read Data Streaming Simplify Streaming Ingestion on AWS Amazon MSK Amazon Redshift Apache Kafka AWS AWS Lambda Python

Streaming ingestion from Kafka (MSK) into Redshift and Athena can be much simpler as they now support direct integration. In part 1, we discuss an end-to-end streaming ingestion solution using EventBridge, Lambda, MSK and Redshift. We also use AWS SAM integrated with Terraform for developing the producer Lambda function locally.

August 6, 202214 min read Data Engineering Apache Airflow AWS AWS Lambda Docker Python

We'll discuss limitations of the Lambda invoke function operator of Apache Airflow and create a custom Lambda operator. The custom operator extends the existing one and it reports the invocation result of a function correctly and records the exact error message from failure.

July 18, 20227 min read Development AWS AWS Lambda AWS SAM Python S3 Serverless Application Model (SAM)

We'll discuss how to build a serverless data processing application using the Serverless Application Model (SAM). A Lambda function is developed, which is triggered whenever an object is created in a S3 bucket. 3rd party packages are necessary for data processing and they are made available by Lambda layers.

October 13, 20216 min read Development Amazon SQS AWS AWS Lambda EventBridge Node.js Serverless Framework

Triggering a Lambda function by an EventBridge Events rule can be used as a serverless replacement of cron job. The highest frequency of it is one invocation per minute so that it cannot be used directly if you need to schedule a Lambda function more frequently. In this post, I’ll demonstrate another serverless solution of scheduling a Lambda function at a sub-minute frequency using Amazon SQS.

April 13, 20209 min read Data Engineering Apache Airflow AWS AWS Lambda Docker Python

In this post, it is demonstrated how AWS Lambda can be integrated with Apache Airflow using a custom operator inspired by the ECS Operator.

July 20, 20196 min read Development AWS AWS Lambda Docker Flask LocalStack Python

LocalStack provides an easy-to-use test/mocking framework for developing AWS applications. In this post, I'll demonstrate how to utilize LocalStack for development using a web service.

Real Time Streaming With Kafka and Flink - Lab 6 Consume Data From Kafka Using Lambda

Real Time Streaming With Kafka and Flink - Lab 1 Produce Data to Kafka Using Lambda

Integrate Glue Schema Registry With Your Python Kafka App

Simplify Streaming Ingestion on AWS – Part 2 MSK and Athena

Simplify Streaming Ingestion on AWS – Part 1 MSK and Redshift

Revisit AWS Lambda Invoke Function Operator of Apache Airflow

Serverless Application Model (SAM) for Data Professionals

Yet Another Serverless Solution for Invoking AWS Lambda at a Sub-Minute Frequency

Thoughts on Apache Airflow AWS Lambda Operator

AWS Local Development With LocalStack