Skip to content

This repository is a curated collection of valuable monitoring tools, resources, and best practices for developers, sysadmins, and DevOps professionals. It covers various aspects of monitoring, including infrastructure, applications, logs, networks, cloud, and Kubernetes.

License

awcodify/awesome-monitoring

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Awesome Monitoring

Welcome to the Awesome Monitoring repository! This collection aims to curate a list of valuable monitoring tools, resources, and best practices for developers, sysadmins, and DevOps professionals.

Categories

Infrastructure Monitoring

Infrastructure monitoring tools are designed to monitor the health and performance of servers, networks, and other critical components of an IT infrastructure. These tools help in identifying issues, analyzing resource utilization, and ensuring the overall stability and reliability of the infrastructure.

Tools

  • Prometheus: An open-source monitoring and alerting toolkit designed for reliability and scalability. It provides time-series data collection, querying, and alerting capabilities, making it a popular choice for monitoring distributed systems.

  • Grafana: A leading open-source analytics and monitoring platform. Grafana allows users to visualize and analyze data from various sources, including Prometheus, Elasticsearch, InfluxDB, and more.

Resources

  • Prometheus Documentation: The official documentation for Prometheus, providing in-depth guides on how to set up, configure, and use Prometheus for monitoring.

  • Grafana Tutorials: A collection of tutorials and guides for using Grafana effectively, covering various aspects of data visualization and analysis.

Application Monitoring

Application monitoring tools focus on tracking the performance and health of applications and services. They help identify bottlenecks, errors, and potential improvements to ensure the smooth functioning of applications.

Tools

  • Datadog: A cloud monitoring platform that offers full-stack observability. Datadog provides comprehensive monitoring, metrics, traces, and logs for infrastructure, applications, and services.

  • New Relic: A monitoring and observability platform that provides real-time insights into the performance and health of applications, servers, and infrastructure.

Resources

  • New Relic University: An online learning platform with courses, tutorials, and documentation on using New Relic for application monitoring and performance management.

Log Management

Log management tools centralize, analyze, and visualize logs generated by various applications and services. These tools help in monitoring system activities, troubleshooting issues, and maintaining compliance.

Tools

  • ELK Stack: A combination of Elasticsearch, Logstash, and Kibana used for centralized log management and analysis. Elasticsearch stores and indexes logs, Logstash processes and ships logs, and Kibana provides a web interface for visualization and analysis.

  • Splunk: A powerful log management and analysis platform that helps organizations gain insights from their machine data.

Resources

Network Monitoring

Network monitoring tools focus on monitoring network devices, traffic, and performance to identify and resolve network-related issues. These tools help in maintaining network health and optimizing performance.

Tools

  • Nagios: A widely-used open-source monitoring system that offers comprehensive monitoring and alerting capabilities for servers, network devices, and applications.

  • PRTG Network Monitor: A powerful network monitoring tool with auto-discovery features, customizable dashboards, and extensive alerting options.

Resources

  • Nagios Core Documentation: The quick start guide for Nagios Core, helping users get started with network monitoring.

  • PRTG Knowledge Base: The PRTG Knowledge Base with articles and guides on network monitoring best practices and troubleshooting.

Cloud Monitoring

Cloud monitoring tools are designed to monitor the performance, usage, and costs of cloud resources and services. These tools help in optimizing cloud infrastructure and ensuring efficient resource utilization.

Tools

  • Amazon CloudWatch: A monitoring service provided by AWS to monitor AWS resources and applications.

  • Google Cloud Monitoring: A monitoring service provided by Google Cloud Platform to monitor the health and performance of applications and infrastructure.

Resources

Kubernetes Monitoring

Monitoring tools specifically designed for Kubernetes to monitor the health and performance of Kubernetes clusters, workloads, and infrastructure.

Tools

  • Prometheus Operator: An operator for Kubernetes that simplifies the deployment and management of Prometheus instances. It allows users to define Prometheus configurations using custom resources.

  • kube-prometheus: A comprehensive collection of Kubernetes manifests, Grafana dashboards, and Prometheus rules that are bundled together to set up monitoring using Prometheus Operator.

  • Prometheus Adapter: An add-on that allows custom metrics to be exposed in Kubernetes APIs, enabling Horizontal Pod Autoscaling based on custom metrics.

  • kube-thanos: An extension to Prometheus that enables long-term storage and global querying capabilities using Thanos. It helps to address challenges related to Prometheus's short retention periods in Kubernetes.

Resources

Contributing

If you have a suggestion for a new monitoring tool or resource, or you want to contribute to the existing list, please read the contribution guidelines first. We welcome and appreciate your contributions!

License

This repository is open-source and available under the LICENSE terms. Please review the license before using or contributing to this repository.

Let's make monitoring awesome together!

About

This repository is a curated collection of valuable monitoring tools, resources, and best practices for developers, sysadmins, and DevOps professionals. It covers various aspects of monitoring, including infrastructure, applications, logs, networks, cloud, and Kubernetes.

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published