A cutting-edge big data initiative aimed at creating a real-time data pipeline to analyze the popularity and sentiments of trending topics on Twitter.
-
Updated
Jul 24, 2023 - Scala
A cutting-edge big data initiative aimed at creating a real-time data pipeline to analyze the popularity and sentiments of trending topics on Twitter.
A Docker container containing an Apache Drill ready to use installation
Apache Drill UDFs for retrieving and working with HTML text
Apache Drill and Apache Zookeeper helm charts for kubernetes
Apache Drill Dialect for SQL Alchemy
Images for creating four-node Drill cluster with HDFS support
XML Plugin for Apache Drill
This application is a proof of concept. I have always wondered how data mining could be done with Java without the direct use of an API. I also wanted to learn more about MongoDB, and NoSQL, as well. This application takes a ticker symbol from the user and scrapes some data from the following URL, like so: http://finance.yahoo.com/quote/${ticker…
Explore data virtualization and query performance optimization with Apache Drill, Hive, and Impala. Tasks include comparing virtualization precision, proposing solutions for a bookstore's diverse data formats, creating Impala databases, and addressing query performance issues. The report offers practical insights and commands for implementation
drill, kafka, spark, elasticsearch, kotlin dataframe
Apache-drill-docker-image
Node.js client for Apache Drill
Popular applications, and Franks! build on top of the Frank!Framework, ready to launch on Kubernetes using Helm.
Apache Drill plugin for LTSV (Labeled Tab-separated Values) files
A collection of UDFs for Apache Drill that implement common cryptographical functions.
Benchmark of different solutions to read from HDFS in real time
Add a description, image, and links to the apache-drill topic page so that developers can more easily learn about it.
To associate your repository with the apache-drill topic, visit your repo's landing page and select "manage topics."