Registration request (expired)
Assignment (Pandas with Youtube stat data):
-
Basic Webpage Scraping -- Note: File 'simple_page.html' will be uploaded to Colab automatically.
-
Twitter Data Extraction -- Note: Do not forget to upload "twitter.yml" to your colab machine and modify the bearer token value.
-
Selenium -- Note: this example cannot be run on Colab.
This section contains example for Kafka. To test, you can use Kafka Server using IP 35.240.149.229 port 9092 or local server.
To run local server, install kafka locally or use the following docker compose file
-
Sample AVRO Schema sample.avsc
-
Basic Spark -- Note: You must upload file 'star-wars.txt' to your colab drive.
-
Spark SQL -- Note: You must upload file 'bank-additional-full.txt' to your colab drive.
-
Spark ML -- Note: You must upload file 'bank-additional-full.txt' to your colab drive.
-
Assignment -- Note: You must upload file 'netflix-rotten-tomatoes-metacritic-imdb.txt' to your colab drive.
All codes are intended to be run in an airflow environment (not on the colab). See airflow code folder for more details.
All codes are intended to be run in a python environment (not on the colab). See FastAPI code folder for more details.
- Assignment visualize netscience.gml.