Skip to content

Using Ruby and Mandy (Flight and Nasdaq datasets)

Seppala edited this page Jul 8, 2011 · 1 revision

1. Flight Prices and Most volatile stocks in NASDAQ

https://github.com/wibblz/Stooge-Reduce

(Hack/Reduce 2 Toronto)

Gar Liu (@lonelydatum), Nathan Rambarran (@wibblz), Khurram Virani (@viranik)

Gar, Nathan and Khurram first wanted to figure out if oil prices affected flight prices. They took oil company stock prices and the average price from all the flights in the dataset. However, the flight dataset was limited and the results ambiguous, so the team changed direction. Next, they wanted to calculate which stocks were the most volatile in the NYSE data. They created a scoring algorithm to determine which stocks are the most volatile.

Team 3 used http://mandyruby.pbworks.com/w/page/1263688/FrontPage, an easy library to use Hadoop with Ruby. They also tried out Wukong, and don’t recommend that. Mandy worked very well though.