Python library to import OCR data in various formats into the canonical JSON format defined by the Impresso project.
-
Updated
May 1, 2024 - Jupyter Notebook
Python library to import OCR data in various formats into the canonical JSON format defined by the Impresso project.
A movie recommender written in Go that suggests movies considering various factors within a particular dataset, encompassing users, movies, and movie ratings.
Analyzing classified ads data from the used motorcycles market. Tasks involve utilizing Redis Bitmaps for analytics on seller actions and MongoDB for analyzing bike listings. Includes data installation, cleaning, and analysis.
Course covers big data fundamentals, processes, technologies, platform ecosystem, and management for practical application development.
A summative coursework for CSC8101 Engineering for AI
The 2022 Big Data Bowl data contains Next Gen Stats player tracking, play, game, player, and PFF scouting data for all 2018-2020 Special Teams play. Here, you'll find a summary of each data set in the 2022 Data Bowl, a list of key variables to join on, and a description of each variable.
Welcome, feel free to navigate through my project. Detail information about each project can be found inside specified directory.
Data Science Assignment file
rock-solid pillars for enterprise-grade solutions
Big Data and AI Engineering bootcamp 2nd capstone project. Using Big Data Tools to predict the probability of university enrollment for Egypt's High School students. 🏫 📚 🔬
This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessary infrastructure components, including Apache Flink, Elasticsearch, and Postgres
excel, markdown, csv, sql 数据源批量/单独格式互相转换
Degree diploma project
Study of French hospital production. (2021)
Eskimo is a state of the art Big Data Infrastructure and Management Web Console to build, manage and operate Big Data 2.0 Analytics clusters on Kubernetes. This is the git repository of Eskimo Community Edition.
Reservoir Sampling for Group-By Queries in Flink Platform. Answering effectively Single Aggregate.
Analysis of Ethereum Transactions and Smart Contracts
Crack Detection model using yolov7
MapReduce Job Development, RDDs Programming, Medical Data Management, Sales Analysis, And Efficient Data Integration For Big Data Analysis. Spark: Big Data Processing, SQOOP Integration, And Spark Structured Streaming For Real-Time Data.
Add a description, image, and links to the big-data-processing topic page so that developers can more easily learn about it.
To associate your repository with the big-data-processing topic, visit your repo's landing page and select "manage topics."