Jaehyeon Kim
Jaehyeon Kim

  • Blog
    • Archives

    • Series

      List of series.

    • Categories

      List of categories.

    • Tags

      List of tags.


/

  • Github Linkedin Paypal RSS

  • Font Size
  • Palette
  • Mode
  1. Home
  2. Archives

2025

7 posts in total
  • Kafka Clients With Avro - Schema Registry and Order Events May 27
  • Kafka Clients With JSON - Producing and Consuming Order Events May 20
  • Meet the Streamhouse Trio - Paimon, Fluss, and Iceberg for Unified Data Architectures May 6
  • Run Flink SQL Cookbook in Docker Apr 15
  • Realtime Dashboard With FastAPI, Streamlit and Next.js - Part 3 Next.js Dashboard Mar 4

2024

29 posts in total
  • Apache Beam Python Examples - Part 10 Develop Streaming File Reader Using Splittable DoFn Dec 19
  • Apache Beam Python Examples - Part 9 Develop Batch File Reader and PiSampler Using Splittable DoFn Dec 5
  • Apache Beam Python Examples - Part 8 Enhance Sport Activity Tracker With Runner Motivation Nov 21
  • Change Data Capture (CDC) Local Development With PostgreSQL, Debezium Server and Pub/Sub Emulator Nov 7
  • Apache Beam Python Examples - Part 7 Separate Droppable Data Into Side Output Oct 24
  • ...

2023

39 posts in total
  • Kafka Development on Kubernetes - Part 1 Cluster Setup Dec 21
  • Real Time Streaming With Kafka and Flink - Lab 6 Consume Data From Kafka Using Lambda Dec 14
  • Setup Local Development Environment for Apache Flink and Spark Using EMR Container Images Dec 7
  • Real Time Streaming With Kafka and Flink - Lab 5 Write Data to DynamoDB Using Kafka Connect Nov 30
  • Real Time Streaming With Kafka and Flink - Lab 4 Clean, Aggregate, and Enrich Events With Flink Nov 23
  • ...

2022

15 posts in total
  • Data Build Tool (Dbt) for Effective Data Transformation on AWS – Part 5 Athena Dec 6
  • Data Build Tool (Dbt) for Effective Data Transformation on AWS – Part 4 EMR on EKS Nov 1
  • Data Build Tool (Dbt) for Effective Data Transformation on AWS – Part 3 EMR on EC2 Oct 19
  • Data Build Tool (Dbt) for Effective Data Transformation on AWS – Part 2 Glue Oct 9
  • Data Build Tool (Dbt) for Effective Data Transformation on AWS – Part 1 Redshift Sep 28
  • ...

2021

7 posts in total
  • Data Lake Demo Using Change Data Capture (CDC) on AWS – Part 3 Implement Data Lake Dec 19
  • Data Lake Demo Using Change Data Capture (CDC) on AWS – Part 2 Implement CDC Dec 12
  • Data Lake Demo Using Change Data Capture (CDC) on AWS – Part 1 Local Development Dec 5
  • Local Development of AWS Glue 3.0 and Later Nov 14
  • Yet Another Serverless Solution for Invoking AWS Lambda at a Sub-Minute Frequency Oct 13
  • ...

2020

1 post in total
  • Thoughts on Apache Airflow AWS Lambda Operator Apr 13

2019

5 posts in total
  • Dynamic Routing and Centralized Auth With Traefik, Python and R Example Nov 29
  • Distributed Task Queue With Python and R Example Nov 15
  • Linux Dev Environment on Windows Nov 1
  • AWS Local Development With LocalStack Jul 20
  • Cronicle Multi Server Setup Jul 19

2018

2 posts in total
  • Shiny to Vue.js May 26
  • Async Shiny and Its Limitation May 19

2017

6 posts in total
  • API Development With R Part II Nov 19
  • API Development With R Part I Nov 18
  • Serverless Data Product POC Backend Part IV - Serving R ML Model via S3 Apr 17
  • Serverless Data Product POC Backend Part III - Exposing R ML Model via APIG Apr 13
  • Serverless Data Product POC Backend Part II - Deploying R ML Model via Lambda Apr 11
  • ...

2016

6 posts in total
  • Some Thoughts on Shiny Open Source - Render Multiple Pages Jun 27
  • Some Thoughts on Shiny Open Source - Internal Load Balancing May 23
  • Asynchronous Processing Using Job Queue May 12
  • Boost SparkR With Hive Apr 30
  • Quick Start SparkR in Local and Cluster Mode Mar 2
  • ...

2015

15 posts in total
  • Quick Test to Wrap Python in R Nov 21
  • Some Thoughts on Python for R Users Aug 9
  • Some Thoughts on Python Aug 8
  • Setup Random Seeds on Caret Package May 30
  • Packaging Analysis Mar 24
  • ...

2014

5 posts in total
  • Looping Without For Dec 17
  • Short R Examples Dec 3
  • Summarise Stock Returns From Multiple Files Nov 27
  • Download Stock Data - Part II Nov 21
  • Download Stock Data - Part I Nov 20
Profile
Jaehyeon Kim
Jaehyeon Kim
Developer Experience at Factor House | Technical Content Creator
Taxonomies
Data Streaming 63 Data Engineering 30 Development 27 Data Analysis 17 Data Integration 12 Kubernetes 5 Security 5 Data Processing 3 Data Architecture 2
Python 66 Apache Kafka 58 AWS 50 Docker 47 R 37 Apache Flink 28 Apache Beam 17 Kafka Connect 15 Amazon MSK 14 AWS Lambda 14 Apache Spark 13 Dbt 13 Amazon EMR 11 Kubernetes 8 Pyflink 8 Change Data Capture (CDC) 6 Debezium 6 Amazon DynamoDB 5 Apache Airflow 5 PostgreSQL 5 PySpark 5 Amazon API Gateway 4 Amazon Athena 4 AWS Glue 4 AWS Glue Schema Registry 4 BigQuery 4 Minikube 4 R Shiny 4 RServe 4 Amazon EKS 3 Amazon QuickSight 3 Apache Hudi 3 EMR on EKS 3 FastAPI 3 GCP 3 GRPC 3 Kpow 3 SparkR 3 WebSocket 3 Amazon Redshift 2 ALL 105
Kafka Development With Docker 11 Apache Beam Python Examples 10 Real Time Streaming With Kafka and Flink 7 DBT Pizza Shop Demo 6 Tree Based Methods in R 6 Apache Beam Local Development With Python 5 DBT for Effective Data Transformation on AWS 5 Kafka Connect for AWS Services Integration 5 Serverless Data Product 4 Data Lake Demo Using Change Data Capture 3 Getting Started With Pyflink on AWS 3 Kafka Development on Kubernetes 3 Parallel Processing on Single Machine 3 Realtime Dashboard With FastAPI, Streamlit and Next.js 3 API Development With R 2 DBT Guide for Production 2 Deploy Python Stream Processing App on Kubernetes 2 Getting Started With Real-Time Streaming in Kotlin 2 Integrate Schema Registry With MSK Connect 2 Kafka, Flink and DynamoDB for Real Time Fraud Detection 2 ALL 21
2025 7 2024 29 2023 39 2022 15 2021 7 2020 1 2019 5 2018 2 2017 6 2016 6 2015 15 2014 5
Posts
  • featured.png
    Meet the Streamhouse Trio - Paimon, Fluss, and Iceberg for Unified Data Architectures
    May 6, 2025
  • featured.gif
    Run Flink SQL Cookbook in Docker
    April 15, 2025
  • featured.gif
    Realtime Dashboard With FastAPI, Streamlit and Next.js - Part 1 Data Producer
    February 18, 2025
  • featured.png
    Change Data Capture (CDC) Local Development With PostgreSQL, Debezium Server and Pub/Sub Emulator
    November 7, 2024
  • featured.png
    Guide to Running DBT in Production
    September 13, 2024
  • featured.png
    DBT CI/CD Demo With BigQuery and GitHub Actions
    September 5, 2024
  • featured.png
    Cache Data on Apache Beam Pipelines Using a Shared Object
    August 22, 2024
  • featured.png
    Apache Beam Python Examples - Part 1 Calculate K Most Frequent Words and Max Word Length
    July 4, 2024
  • featured.png
    Deploy Python Stream Processing App on Kubernetes - Part 1 PyFlink Application
    May 30, 2024
  • featured.png
    Apache Beam Local Development With Python - Part 1 Pipeline, Notebook, SQL and DataFrame
    March 28, 2024
  • featured.png
    Kafka Clients With Avro - Schema Registry and Order Events
    May 27, 2025
  • featured.png
    Kafka Clients With JSON - Producing and Consuming Order Events
    May 20, 2025
  • featured.png
    Meet the Streamhouse Trio - Paimon, Fluss, and Iceberg for Unified Data Architectures
    May 6, 2025
  • featured.gif
    Run Flink SQL Cookbook in Docker
    April 15, 2025
  • featured.gif
    Realtime Dashboard With FastAPI, Streamlit and Next.js - Part 3 Next.js Dashboard
    March 4, 2025
  • featured.gif
    Realtime Dashboard With FastAPI, Streamlit and Next.js - Part 2 Streamlit Dashboard
    February 25, 2025
  • featured.gif
    Realtime Dashboard With FastAPI, Streamlit and Next.js - Part 1 Data Producer
    February 18, 2025
  • featured.png
    Apache Beam Python Examples - Part 10 Develop Streaming File Reader Using Splittable DoFn
    December 19, 2024
  • featured.png
    Apache Beam Python Examples - Part 9 Develop Batch File Reader and PiSampler Using Splittable DoFn
    December 5, 2024
  • featured.png
    Apache Beam Python Examples - Part 8 Enhance Sport Activity Tracker With Runner Motivation
    November 21, 2024
Actions
Go back Reload Copy URL

Jaehyeon Kim

Developer Experience at Factor House | Technical Content Creator

Copyright © 2023-2025 Jaehyeon Kim. All Rights Reserved.