Kafka Connect

Kafka Development on Kubernetes - Part 3 Kafka Connect

January 11, 20247 min read Data Integration Data Streaming Kubernetes Kafka Development on Kubernetes Apache Kafka Docker Kafka Connect Kubernetes Minikube Python

Kafka Connect is a tool for scalably and reliably streaming data between Apache Kafka and other systems. In this post, we discuss how to set up a data ingestion pipeline using Kafka connectors. Fake customer and order data is ingested into Kafka topics using the MSK Data Generator. Also, we use the Confluent S3 sink connector to save the messages of the topics into a S3 bucket. The Kafka Connect servers and individual connectors are deployed using the custom resources of Strimzi on Kubernetes.

November 30, 20239 min read Data Streaming Real Time Streaming With Kafka and Flink Amazon DynamoDB Apache Flink Apache Kafka AWS Kafka Connect Kpow

Kafka Connect is a tool for scalably and reliably streaming data between Apache Kafka and other systems. It makes it simple to quickly define connectors that move large collections of data into and out of Kafka. In this lab, we will discuss how to create a data pipeline that ingests data from a Kafka topic into a DynamoDB table using the Camel DynamoDB sink connector.

October 30, 202318 min read Data Streaming Kafka Connect for AWS Services Integration Amazon MSK Amazon OpenSearch Service Apache Kafka AWS Kafka Connect Kpow

In the previous post, we discussed how to develop a data pipeline from Apache Kafka into OpenSearch locally using Docker. The pipeline will be deployed on AWS using Amazon MSK, Amazon MSK Connect and Amazon OpenSearch Service using Terraform in this post. First the infrastructure will be deployed that covers a VPC, VPN server, MSK Cluster and OpenSearch domain. Then Kafka source and sink connectors will be deployed on MSK Connect, followed by performing quick data analysis.

October 23, 202312 min read Data Integration Data Streaming Kafka Connect for AWS Services Integration Apache Kafka AWS Docker Kafka Connect Kpow OpenSearch

Kafka Connect can be an effective tool to ingest data from Apache Kafka into OpenSearch. In this post, we will discuss how to develop a data pipeline from Apache Kafka into OpenSearch locally using Docker while the pipeline will be deployed on AWS in the next post. Fake impressions and clicks data will be pushed into Kafka topics using a Kafka source connector and those records will be ingested into OpenSearch indexes using a sink connector for near-real time analytics.

July 3, 202314 min read Data Integration Data Streaming Kafka Connect for AWS Services Integration Amazon DynamoDB Amazon MSK Apache Camel Apache Kafka AWS Kafka Connect Kpow

As part of investigating how to utilize Kafka Connect effectively for AWS services integration, I demonstrated how to develop the Camel DynamoDB sink connector using Docker in Part 2. Fake order data was generated using the MSK Data Generator source connector, and the sink connector was configured to consume the topic messages to ingest them into a DynamoDB table. In this post, I will illustrate how to deploy the data ingestion applications using Amazon MSK and MSK Connect.

June 15, 202312 min read Data Integration Data Streaming Kafka Development With Docker Apache Kafka AWS AWS Glue Schema Registry Docker Kafka Connect Kpow

In Part 3, we developed a data ingestion pipeline using Kafka Connect source and sink connectors without enabling schemas. Later we discussed the benefits of schema registry when developing Kafka applications in Part 5. In this post, I'll demonstrate how to enhance the existing data ingestion pipeline by integrating AWS Glue Schema Registry.

June 8, 20237 min read Data Streaming Kafka Development With Docker Apache Kafka AWS AWS Glue Schema Registry Kafka Connect Schema Registry

The Glue Schema Registry supports features to manage and enforce schemas on data streaming applications using convenient integrations with Apache Kafka and other AWS managed services. In order to utilise those features, we need to use the client library. In this post, I'll illustrate how to build the client library after introducing how it works to integrate the Glue Schema Registry with Kafka producer and consumer apps.

June 4, 202313 min read Data Streaming Kafka Connect for AWS Services Integration Amazon DynamoDB Apache Camel Apache Kafka AWS Docker Kafka Connect

The suite of Apache Camel Kafka connectors and the Kinesis Kafka connector from the AWS Labs can be effective for building data ingestion pipelines that integrate AWS services. In this post, I will illustrate how to develop the Camel DynamoDB sink connector using Docker. Fake order data will be generated using the MSK Data Generator source connector, and the sink connector will be configured to consume the topic messages to ingest them into a DynamoDB table.

May 25, 20239 min read Data Integration Data Streaming Kafka Development With Docker Apache Kafka Docker Kafka Connect

Kafka Connect is a tool for scalably and reliably streaming data between Apache Kafka and other systems. In this post, I will illustrate how to set up a data ingestion pipeline using Kafka connectors. Fake customer and order data will be ingested into the corresponding topics using the MSK Data Generator source connector. The topic messages will then be saved into a S3 bucket using the Confluent S3 sink connector.

May 3, 20234 min read Data Integration Data Streaming Kafka Connect for AWS Services Integration Amazon MSK Apache Kafka AWS Kafka Connect

Kafka Connect is a tool for scalably and reliably streaming data between Apache Kafka and other systems. It can be used to build real-time data pipeline on AWS effectively. In this post, I will introduce available Kafka connectors mainly for AWS services integration. Also, developing and deploying some of them will be covered in later posts.

Kafka Development on Kubernetes - Part 3 Kafka Connect

Real Time Streaming With Kafka and Flink - Lab 5 Write Data to DynamoDB Using Kafka Connect

Kafka Connect for AWS Services Integration - Part 5 Deploy Aiven OpenSearch Sink Connector

Kafka Connect for AWS Services Integration - Part 4 Develop Aiven OpenSearch Sink Connector

Kafka Connect for AWS Services Integration - Part 3 Deploy Camel DynamoDB Sink Connector

Kafka Development With Docker - Part 6 Kafka Connect With Glue Schema Registry

Kafka Development With Docker - Part 5 Glue Schema Registry

Kafka Connect for AWS Services Integration - Part 2 Develop Camel DynamoDB Sink Connector

Kafka Development With Docker - Part 3 Kafka Connect

Kafka Connect for AWS Services Integration - Part 1 Introduction