Jaehyeon Kim
Jaehyeon Kim

  • Blog
    • Archives

    • Series

      List of series.

    • Categories

      List of categories.

    • Tags

      List of tags.


/

  • Github Linkedin Paypal RSS

  • Font Size
  • Palette
  • Mode

Some Thoughts on Shiny Open Source - Render Multiple Pages

June 27, 20165 min read DevelopmentRR Shiny

In this post, a simple way of internal load balancing is demonstrated by redirecting multiple same applications, depending on the number of processes binded to them

Read More

Some Thoughts on Shiny Open Source - Internal Load Balancing

May 23, 20165 min read DevelopmentRR Shiny

In this post, a simple way of internal load balancing is demonstrated by redirecting multiple same applications, depending on the number of processes binded to them

Read More

Asynchronous Processing Using Job Queue

May 12, 20163 min read DevelopmentR

In this post, a way to overcome one of R's limitations of lack of multi-threading is discussed by job queuing using the jobqueue package

Read More

Boost SparkR With Hive

April 30, 20168 min read Data EngineeringApache HiveApache SparkHiveQLRSparkR

One option to boost SparkR's performance as a data processing engine is manipulating data in Hive Context rather than in limited SQL Context. In this post, we discuss how to run SparkR in Hive Context.

Read More

Quick Start SparkR in Local and Cluster Mode

March 2, 20168 min read Data EngineeringApache SparkRSparkR

In this post, we discuss how to execute SparkR in a local and cluster mode.

Read More

Spark Cluster Setup on VirtualBox

February 22, 20163 min read Data EngineeringApache SparkRSparkR

We discuss how to set up a Spark cluser between 2 Ubuntu guests. Firstly it begins with machine preparation.

Read More

Quick Test to Wrap Python in R

November 21, 20154 min read DevelopmentPythonR

We discuss how to make use of Python outcomes in R using a package.

Read More

Some Thoughts on Python for R Users

August 9, 20155 min read DevelopmentPythonR

An article that motivates the benefits of Python for R users.

Read More

Some Thoughts on Python

August 8, 20153 min read DevelopmentPythonR

Introduction to Python

Read More

Setup Random Seeds on Caret Package

May 30, 20156 min read Data AnalysisR

Setting up random seed is important for reproducibility of analysis. In this post, we discuss how to generate random seed using the caret package.

Read More
  • ««
  • «
  • 10
  • 11
  • 12
  • 13
  • 14
  • »
  • »»
Profile
Jaehyeon Kim
Jaehyeon Kim
Developer Experience at Factor House | Technical Content Creator
Taxonomies
Data Streaming 62 Data Engineering 30 Development 27 Data Analysis 17 Data Integration 12 Kubernetes 5 Security 5 Data Processing 3 Data Architecture 2
Python 66 Apache Kafka 57 AWS 50 Docker 46 R 37 Apache Flink 28 Apache Beam 17 Kafka Connect 15 Amazon MSK 14 AWS Lambda 14 Apache Spark 13 Dbt 13 Amazon EMR 11 Kubernetes 8 Pyflink 8 Change Data Capture (CDC) 6 Debezium 6 Amazon DynamoDB 5 Apache Airflow 5 PostgreSQL 5 PySpark 5 Amazon API Gateway 4 Amazon Athena 4 AWS Glue 4 AWS Glue Schema Registry 4 BigQuery 4 Minikube 4 R Shiny 4 RServe 4 Amazon EKS 3 Amazon QuickSight 3 Apache Hudi 3 EMR on EKS 3 FastAPI 3 GCP 3 GRPC 3 SparkR 3 WebSocket 3 Amazon Redshift 2 Amazon S3 2 ALL 105
Kafka Development With Docker 11 Apache Beam Python Examples 10 Real Time Streaming With Kafka and Flink 7 DBT Pizza Shop Demo 6 Tree Based Methods in R 6 Apache Beam Local Development With Python 5 DBT for Effective Data Transformation on AWS 5 Kafka Connect for AWS Services Integration 5 Serverless Data Product 4 Data Lake Demo Using Change Data Capture 3 Getting Started With Pyflink on AWS 3 Kafka Development on Kubernetes 3 Parallel Processing on Single Machine 3 Realtime Dashboard With FastAPI, Streamlit and Next.js 3 API Development With R 2 DBT Guide for Production 2 Deploy Python Stream Processing App on Kubernetes 2 Integrate Schema Registry With MSK Connect 2 Kafka, Flink and DynamoDB for Real Time Fraud Detection 2 Simplify Streaming Ingestion on AWS 2 ALL 21
2025 6 2024 29 2023 39 2022 15 2021 7 2020 1 2019 5 2018 2 2017 6 2016 6 2015 15 2014 5
Posts
  • featured.png
    Meet the Streamhouse Trio - Paimon, Fluss, and Iceberg for Unified Data Architectures
    May 6, 2025
  • featured.gif
    Run Flink SQL Cookbook in Docker
    April 15, 2025
  • featured.gif
    Realtime Dashboard With FastAPI, Streamlit and Next.js - Part 1 Data Producer
    February 18, 2025
  • featured.png
    Change Data Capture (CDC) Local Development With PostgreSQL, Debezium Server and Pub/Sub Emulator
    November 7, 2024
  • featured.png
    Guide to Running DBT in Production
    September 13, 2024
  • featured.png
    DBT CI/CD Demo With BigQuery and GitHub Actions
    September 5, 2024
  • featured.png
    Cache Data on Apache Beam Pipelines Using a Shared Object
    August 22, 2024
  • featured.png
    Apache Beam Python Examples - Part 1 Calculate K Most Frequent Words and Max Word Length
    July 4, 2024
  • featured.png
    Deploy Python Stream Processing App on Kubernetes - Part 1 PyFlink Application
    May 30, 2024
  • featured.png
    Apache Beam Local Development With Python - Part 1 Pipeline, Notebook, SQL and DataFrame
    March 28, 2024
  • featured.png
    Kafka Clients With JSON - Producing and Consuming Order Events
    May 20, 2025
  • featured.png
    Meet the Streamhouse Trio - Paimon, Fluss, and Iceberg for Unified Data Architectures
    May 6, 2025
  • featured.gif
    Run Flink SQL Cookbook in Docker
    April 15, 2025
  • featured.gif
    Realtime Dashboard With FastAPI, Streamlit and Next.js - Part 3 Next.js Dashboard
    March 4, 2025
  • featured.gif
    Realtime Dashboard With FastAPI, Streamlit and Next.js - Part 2 Streamlit Dashboard
    February 25, 2025
  • featured.gif
    Realtime Dashboard With FastAPI, Streamlit and Next.js - Part 1 Data Producer
    February 18, 2025
  • featured.png
    Apache Beam Python Examples - Part 10 Develop Streaming File Reader Using Splittable DoFn
    December 19, 2024
  • featured.png
    Apache Beam Python Examples - Part 9 Develop Batch File Reader and PiSampler Using Splittable DoFn
    December 5, 2024
  • featured.png
    Apache Beam Python Examples - Part 8 Enhance Sport Activity Tracker With Runner Motivation
    November 21, 2024
  • featured.png
    Change Data Capture (CDC) Local Development With PostgreSQL, Debezium Server and Pub/Sub Emulator
    November 7, 2024
Actions
Go back Reload Copy URL

Jaehyeon Kim

Developer Experience at Factor House | Technical Content Creator

Copyright © 2023-2025 Jaehyeon Kim. All Rights Reserved.