Elastic Stack Udemy Course Notes

Course URL: https://www.udemy.com/course/elasticsearch-complete-guide/learn/lecture/7373340?start=0#overview

All images taken directly from Bo Andersen's lecture slides.

Action	Date
Started Course	8/29/22
Completed Lectures 1-X	8/29/22

Lecture 3: Elastic Stack:

Elastic Search
Kibana
Logstash: A data processing pipeline. It receives events, processes/filters them, and sends them to one or more platforms. Defined in a proprietary markup language.
X-Pack: Adds additional features to the Elasticsearch & kibana. The most important of these features include:
- Security: Adds authentication and authorization to elasticsearch and kibana. Controls user permissions. Different people might need different privileges.
- Monitoring: Gain insight into how the elastic stack is running.
- Alerting: Check if CPU usage is too high, or errors starting propagating.
- Reporting
- Machine Learning: Abnormality Detection, Forecasting, etc...
- Graphs: Analyses relationships between data.
- SQL: Elasticsearch queries are written in Query DSL. It is flexible but verbose.
Beats

Summary of Elastic Stack

The center of it all is elastic search, which contains the data. Injecting data into elastic search can be done with beats or elastic stash, as well as through elasticsearch's api. Kibana is a UI that sits on top of elasticsearch to let you visualize the data that you receive. X-pack enables additional features such as ML.

ELK Stack = Elasticsearch + Logstash + Kibana This term originates from before X-pack existed. The elastic stack is a superset of the ELK stack.

Lecture 4: Walk-through of common architectures

Suppose we have an E-commerce app. Our data is stored in a relational db such as postgres.

We want to improve the search functionality of this app. So far it has been directly using the postgres db to get search info, but this is inefficient and not what dbs are for. Elasticsearch is much better for this. When a user types in a search from the e-commerce frontend, the request is sent directly to elasticsearch. This can be done with an HTTP req, however:

But how do we get data into elasticsearch in the first place? And how do we keep it updated? We will do this with data duplication from our postgres db. If a user adds a new product though, how do we get it into elasticsearch? You will need to write a script that imports that data. From the moment the data is imported elasticsearch will keep it updated. This is the simplistic usage of elasticsearch.

Now, what if we want to implement a UI for elasticsearch? We would use kibana for this.

We will need to spin up a dedicated server to run kibana and configure it with elasticsearch. Overtime, if web traffic increases, the server will start to sweat. We will need to monitor server resources with Metricbeat by installing it on the kibana server. But how does the Metricbeat data get into elasticsearch? We can simply configure metricbeat to send data to elasticsearch via an ingest node. The details of this is not important right now. Now that system metrics are being stored in elastic search we can visualize the metric data in kibana. Metricbeat has a default dashboard within kibana.

We want to monitor access logs and error logs now too. We can even check response times for each endpoint. This allows us to identify bad deployments. Filebeat can be used for this task.

Next, fast forward 6 months. We have added a ton more code, functionality, and products to the e-commerce app. We will want to wire up Logstash for event processing.

Lecture 10: Understanding the basics, ARCHITECTURE

When we started up elasticsearch, what actually happened is that we started up a node.

Node: An instance of elasticsearch that stores data. To ensure we can store many terabytes of data, we can spin up as many nodes as we want. Each node will then store a part of our data. You can run any number of nodes on a single machine. In development it is not a huge deal to have many nodes on a single machine, but in prod you should have one node per server or container.

Each node belongs to what is called a Cluster.

Cluster: A collection of related nodes that together, contain all of our data. We can have many clusters if we want, but one is usually enough. It is possible to perform cross cluster searches, but it is not common.

When a node starts up, it will automatically create its own cluster, or join a cluster that is already running. There are problems with only having 1 node...

But how is data organized and stored? Each unit of data that you store within a cluster is called a document. Documents are JSON objects containing whatever information you want to store. When you index a document, the original json document that you sent to elasticsearch, is stored along with metadata.

Every document is stored with an index. An index groups documents together logically, as well as provides configuration objects.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
course-diagrams		course-diagrams
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

course-diagrams

course-diagrams

.gitignore

.gitignore

README.md

README.md

Repository files navigation

Elastic Stack Udemy Course Notes

Lecture 3: Elastic Stack:

Summary of Elastic Stack

Lecture 4: Walk-through of common architectures

Lecture 10: Understanding the basics, ARCHITECTURE

About

DylanMorison/elastic-search-tutorial

Folders and files

Latest commit

History

course-diagrams

course-diagrams

.gitignore

.gitignore

README.md

README.md

Repository files navigation

Elastic Stack Udemy Course Notes

Lecture 3: Elastic Stack:

Summary of Elastic Stack

Lecture 4: Walk-through of common architectures

Lecture 10: Understanding the basics, ARCHITECTURE

About

Topics

Resources

Stars

Watchers

Forks