Working examples for some components on GCP, and instructions on how to run them.
-
Updated
Apr 26, 2017 - Java
Working examples for some components on GCP, and instructions on how to run them.
Determination of which words occur in a dataset of textbooks along with each word's occurrence count identification with the help of Google Cloud Platform based Dataproc cluster formation.
Collection of personal resources on Google Cloud
Google DataProc Spark Scala Job for MNIST Handwritten Digit Recognition using Decision Trees (Spark MLlib)
Collected data about from three sources, one opinion-based social media in twitter, research data in New York Times, and the third is the common crawl data for the same topic or key phrase, and from similar time periods. Processed the three data sets collected individually using classical big data methods like Map Reduce in Google Dataproc Clust…
Creating an Inverted Index of words occurring in a large set of documents extracted from web pages using Hadoop MapReduce and Google Dataproc
Dataproc Customisable HA cluster debian-9 with zookeeper,kafka ,BigQuery and other tools/jobs with Terraform
Data Workflows with GCP Dataproc, Apache Airflow and Apache Spark
gke with terraform, dataproc with terraform
Running a wordcount job on a Google Dataproc cluster
Big data analysis of 'shared-world' cloud application.
Cloud application to promote responsible tourism and help prevent overtourism.
Using PySpark for Tensorflow model inferencing on GCP Dataproc Cluster. Demo for PyCon Hong Kong Fall 2020 Presentation
Add a description, image, and links to the dataproc topic page so that developers can more easily learn about it.
To associate your repository with the dataproc topic, visit your repo's landing page and select "manage topics."