Skip to content

jseidman/hadoop-R

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 

Repository files navigation

Examples of integrating Hadoop and R. This directory contains the following:

airline/

Examples which use the flight arrival and departure data available here: 
 
  http://stat-computing.org/dataexpo/2009/the-data.html

Note that this is the same data set used for many of the examples in the RHIPE documentation. 

The following examples are in this directory:

airline/src/deptdelay_by_month/R/streaming/ - Example that uses the Hadoop streaming MapReduce interface to calculate average departure delay by month for each airline.

airline/src/deptdelay_by_month/R/hive - Example using Hadoop Interactive for running MapReduce code to calculate average departure delay by month for each airline.

airline/src/deptdelay_by_month/R/rhipe - Example using RHIPE to run MapReduce code that calculates average departure delay by month for each airline and then visualize the results.

airline/src/deptdelay_by_month/R/rmr - Example using Revolution Analytics rmr package to calculate average departure delay by month for each airline.

Instructions for running the code can be found with each example.

About

Example code for running R on Hadoop

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages