Docker

Use External Schema Registry With MSK Connect – Part 2 MSK Deployment

April 3, 20227 min read Data Integration Data Streaming Integrate Schema Registry With MSK Connect Amazon ECS Amazon MSK Apache Kafka Apicurio Registry AWS Change Data Capture (CDC)Debezium Docker Kafka Connect

We'll continue the discussion of a Change Data Capture (CDC) solution with a schema registry and its deployment to AWS. All major resources are deployed in private subnets and VPN is used to access them in order to improve developer experience. The Apicurio registry is used as the schema registry service and it is deployed as an ECS service. In order for the connectors to have access to the registry, the Confluent Avro Converter is packaged together with the connector sources. The post ends with illustrating how schema evolution is managed by the schema registry.

March 7, 202210 min read Data Integration Data Streaming Integrate Schema Registry With MSK Connect Apache Kafka Apicurio Registry AWS Change Data Capture (CDC)Debezium Docker Kafka Connect

We'll discuss a Change Data Capture (CDC) architecture with a schema registry. As a starting point, a local development environment is set up using Docker Compose. The Debezium and Confluent S3 connectors are deployed with the Confluent Avro converter and the Apicurio registry is used as the schema registry service. A quick example is shown to illustrate how schema evolution can be managed by the schema registry.

November 14, 20218 min read Data Engineering AWS AWS Glue Docker PySpark Python

Recently AWS Glue 3.0 was released but a docker image for this version is not published. In this post, I’ll illustrate how to create a development environment for AWS Glue 3.0 (and later versions) by building a custom docker image.

August 20, 20219 min read Data Engineering Apache Spark AWS AWS Glue Docker PySpark Python

In this post, I'll demonstrate how to build development environments for AWS Glue 1.0 and 2.0 using the Docker image and the Visual Studio Code Remote - Containers extension.

April 13, 20209 min read Data Engineering Apache Airflow AWS AWS Lambda Docker Python

In this post, it is demonstrated how AWS Lambda can be integrated with Apache Airflow using a custom operator inspired by the ECS Operator.

November 29, 20199 min read Development Docker FastAPI Python R Traefik

Traefik is a modern HTTP reverse proxy and load balancer. In this post, it'll be demonstrated how path-based routing can be set up by Traefik with Docker. Also a centralized authentication will be illustrated with the Forward Authentication feature of Traefik.

November 1, 201912 min read Development Docker Kubernetes Minikube Python R WSL

In this post, I'll demonstrate how to create a Linux development environment on Windows using WSL. Also an example app (Rserve web service with a sidecar container) on Minikube will be demonstrated.

July 20, 20196 min read Development AWS AWS Lambda Docker Flask LocalStack Python

LocalStack provides an easy-to-use test/mocking framework for developing AWS applications. In this post, I'll demonstrate how to utilize LocalStack for development using a web service.

July 19, 20195 min read Development Cronicle Docker

Cronicle is a multi-server task scheduler and runner. In this post, multi-server configuration of Cronicle will be demonstrated with Docker and Nginx as load balancer.

November 19, 20176 min read Development API Development With R Docker Plumber R RApache Rserve

In part I, it is discussed how to serve an R function with plumber, Rserve and rApache. In this post, the APIs are deployed in a Docker container and, after showing example requests, their performance is compared.

Use External Schema Registry With MSK Connect – Part 2 MSK Deployment

Use External Schema Registry With MSK Connect – Part 1 Local Development

Local Development of AWS Glue 3.0 and Later

AWS Glue Local Development With Docker and Visual Studio Code

Thoughts on Apache Airflow AWS Lambda Operator

Dynamic Routing and Centralized Auth With Traefik, Python and R Example

Linux Dev Environment on Windows

AWS Local Development With LocalStack

Cronicle Multi Server Setup

API Development With R Part II