Skip to content

mateiz/shark

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Shark (Hive on Spark)

Shark is a large-scale data warehouse system for Spark designed to be compatible with Apache Hive. It can answer Hive QL queries up to 100 times faster than Hive without modification to the existing data nor queries. Shark supports Hive's query language, metastore, serialization formats, and user-defined functions.

Shark 0.8.0 requires:

  • Scala 2.9.3
  • Hive 0.9
  • Spark 0.8.x
  • OpenJDK 7 or Oracle HotSpot JDK 7 or Oracle HotSpot JDK 6u23+

For current documentation, see the Shark Project Wiki

Releases

No releases published

Packages

No packages published

Languages

  • Scala 87.1%
  • Java 6.2%
  • Shell 6.2%
  • Other 0.5%