Big-Data Platform

This project will contains some code exaplains for both implementing infrastructure for big data based projects and well the application part, the source code aims to show how to implement different big data layers based on different big data technologies (hadoop, mongodb...).

It is structured as follow :

Engineering : contains applications and notebooks
- Notebooks : explorations and PoC
- Applications : PoC, MVPs and production ready
Platform : contains scripts, docker images and applicaiton for platform management
- Infrastructure : Scripts for platform installation, patch setup, and tools deployment.
- Inventories : platform hosts and credentials, and ansible inventories
- Managmeent : Platform management scripts (services control, platform initialization...)
- Services : Platform applications for data ingestion, integration, and egress. or specific application configuration and tunning The source code uses :
docker : to create and run on the fly a iso-prod environments, servers and applications
ansible and Bash scripts : to automate applications deployment and platform configuration, but also to interact and execute others scripts (hive, sqoop, pig, spark-submit)
sqoop : to ingest and egress data from and to Relational data bases
spark : as a distributed computation framework
hadoop : as a complete data-lake and analytics eco-system
java and scala : as the main projects languages
Hive, Impala :
Pig :
Kafka :

Todo :

Code Cleaning and Refactoring according to JIRA 19282 case
Project Structure change according to JIRA 19287 case
move command_prompt from .sh bash files to ansible
automate patchs and features deployment

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Engineering		Engineering
Platform		Platform
.gitattribut		.gitattribut
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
_config.yml		_config.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Engineering

Engineering

Platform

Platform

.gitattribut

.gitattribut

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

_config.yml

_config.yml

Repository files navigation

Big-Data Platform

About

Releases

Packages

Languages

License

MehdiTAZI/BigData-Platform

Folders and files

Latest commit

History

Repository files navigation

Big-Data Platform

About

Topics

Resources

License

Stars

Watchers

Forks

Languages