Popular repositories
-
flink-crawler
flink-crawler PublicContinuous scalable web crawler built on top of Flink and crawler-commons
-
-
-
cascading.avro
cascading.avro PublicForked from clizzin/cascading.avro
Cascading Scheme for the Apache Avro data serialization format
-
-
Repositories
Showing 10 of 32 repositories
-
-
- pinot Public Forked from apache/pinot
Apache Pinot (Incubating) - A realtime distributed OLAP datastore
-
- flink-crawler-ccdemo Public
Demo of using flink-crawler to extract pages from Common Crawl for a target language
-
-
- cascading Public Forked from Cascading/cascading
Cascading is a feature rich API for defining and executing complex and fault tolerant data processing workflows on various cluster computing platforms. Please see https://github.com/cwensel/cascading for access to all WIP branches.
-
- fastText Public Forked from facebookresearch/fastText
Library for fast text representation and classification.