GitHub - FINRAOS/herd: Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabytes of data and make it accessible for data processing and analytical purposes by any cloud compute platform.

Overview

Herd is big data governance for the cloud. The herd unified data catalog helps separate compute from storage in the cloud. Herd job orchestration manages your ETL and analytics processes while tracking all data in the catalog. Here is a quick summary of features:

Unified Data Catalog A centralized, auditable catalog for operational usage and data governance.
Track Lineage Capture data ancestry for regulatory, forensic, and analytical purposes
Manage Clusters Create and launch clusters; load data into clusters from catalog entries
Orchestrate Jobs Orchestrate clusters and catalog services to automate processing jobs

Find out more about herd features on our GitHub project page

Quick Start

The best way to start learning about herd is through these links. The demo installation process is quick and easy - you can have herd up and running in AWS in 10-15 minutes and start registering data immediately afterwards.

Get Involved

We are actively seeking organizations and individuals that are interested in adopting herd and contributing to the development effort. Find out more in the contributions section of our GitHub project page. If you have any questions or discussion topics, post them on GitHub Issues or email us at herd@finra.org.

License

Herd is licensed under Apache License 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 4,554 Commits
herd-build		herd-build
herd-code		herd-code
.editorconfig		.editorconfig
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
build_ci.sh		build_ci.sh
pom.xml		pom.xml
update-version.sh		update-version.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

herd-build

herd-build

herd-code

herd-code

.editorconfig

.editorconfig

.gitignore

.gitignore

.travis.yml

.travis.yml

LICENSE

LICENSE

README.md

README.md

build_ci.sh

build_ci.sh

pom.xml

pom.xml

update-version.sh

update-version.sh

Repository files navigation

Overview

Quick Start

Get Involved

License

About

Releases 15

Packages

Contributors 25

Languages

License

FINRAOS/herd

Folders and files

Latest commit

History

Repository files navigation

Overview

Quick Start

Get Involved

License

About

Resources

License

Stars

Watchers

Forks

Languages