R - Categories - Jaehyeon Kim

Asynchronous Processing Using Job Queue

May 12, 20163 min read R R

In this post, a way to overcome one of R's limitations of lack of multi-threading is discussed by job queuing using the jobqueue package

Boost SparkR With Hive

April 30, 20168 min read R Apache Hive Apache Spark R SparkR

One option to boost SparkR's performance as a data processing engine is manipulating data in Hive Context rather than in limited SQL Context. In this post, we discuss how to run SparkR in Hive Context.

Quick Start SparkR in Local and Cluster Mode

March 2, 20168 min read R Apache Spark R SparkR

In this post, we discuss how to execute SparkR in a local and cluster mode.

Spark Cluster Setup on VirtualBox

February 22, 20163 min read R Apache Spark R SparkR VirtualBox

We discuss how to set up a Spark cluser between 2 Ubuntu guests. Firstly it begins with machine preparation.

Quick Test to Wrap Python in R

November 21, 20154 min read R Python R

We discuss how to make use of Python outcomes in R using a package.

Packaging Analysis

March 24, 20158 min read R R

We discuss how to turn analysis into an R package.

Parallel Processing on Single Machine - Part III

March 19, 20158 min read R Parallel Processing on Single Machine R

Part II that demonstrates how to implement parallem processing on single machine in R

Parallel Processing on Single Machine - Part II

March 17, 20156 min read R Parallel Processing on Single Machine R

Part III that demonstrates how to implement parallem processing on single machine in R

Parallel Processing on Single Machine - Part I

March 14, 20157 min read R Parallel Processing on Single Machine R

Part I that demonstrates how to implement parallem processing on single machine in R

Quick Trial of Adding Column

January 14, 20152 min read R R

This is a quick trial of adding overall and conditional (by user) average columns in a data frame.