Back
gh

netdata/netdata: The fastest path to AI-powered full stack observability, even for lean teams.

The fastest path to AI-powered full stack observability, even for lean teams. - netdata/netdata

by netdata github.com 1,939 words
View original

NetdataNetdata

X-Ray Vision for your infrastructure!

Every Metric, Every Second. No BS.

GitHub Stars
Live Demo Latest release Latest nightly build
Discourse topics GitHub Discussions
CII Best Practices Coverity Scan

User base Servers monitored Sessions served Docker Hub pulls

Visit our Home Page


MENU: WHO WE ARE | KEY FEATURES | GETTING STARTED | HOW IT WORKS | FAQ | DOCS | COMMUNITY | CONTRIBUTE | LICENSE

[!warning] Warning People get addicted to Netdata. Once you use it on your systems, there’s no going back.

Platforms


WHO WE ARE

Netdata is an open-source, real-time infrastructure monitoring platform. Monitor, detect, and act across your entire infrastructure.

Core Advantages:

With Netdata, you get real-time, per-second updates. Clear insights at a glance, no complexity.

All heroes have a great origin story. Click to discover ours.

In 2013, at the company where Costa Tsaousis was COO, a significant percentage of their cloud-based transactions failed silently, severely impacting business performance.

Costa and his team tried every troubleshooting tool available at the time. None could identify the root cause. As Costa later wrote:

I couldn’t believe that monitoring systems provide so few metrics and with such low resolution, scale so badly, and cost so much to run.

Frustrated, he decided to build his own monitoring tool, starting from scratch.

That decision led to countless late nights and weekends. It also sparked a fundamental shift in how infrastructure monitoring and troubleshooting are approached, both in method and in cost.

Most Energy-Efficient Monitoring Tool

Energy EfficiencyEnergy efficiency

According to the University of Amsterdam study, Netdata is the most energy-efficient tool for monitoring Docker-based systems. The study also shows Netdata excels in CPU usage, RAM usage, and execution time compared to other monitoring solutions.


Key Features

FeatureDescriptionWhat Makes It Unique
Real-TimePer-second data collection and processingWorks in a beat – click and see results instantly
Zero-ConfigurationAutomatic detection and discoveryAuto-discovers everything on the nodes it runs
ML-PoweredUnsupervised anomaly detectionTrains multiple ML models per metric at the edge
Long-Term RetentionHigh-performance storage~0.5 bytes per sample with tiered storage for archiving
Advanced VisualizationRich, interactive dashboardsSlice and dice data without query language
Extreme ScalabilityNative horizontal scalingParent-Child centralization with multi-million samples/s
Complete VisibilityFrom infrastructure to applicationsSimplifies operations and eliminates silos
Edge-BasedProcessing at your premisesDistributes code instead of centralizing data

[!note] Note Want to put Netdata to the test against Prometheus? Explore the full comparison.


Netdata Ecosystem

This three-part architecture enables you to scale from single nodes to complex multi-cloud environments:

ComponentDescriptionLicense
Netdata Agent• Core monitoring engine • Handles collection, storage, ML, alerts, exports • Runs on servers, cloud, K8s, IoT • Zero production impactGPL v3+
Netdata Cloud• Enterprise features • User management, RBAC, horizontal scaling • Centralized alerts • Free community tier • No metric storage centralization
Netdata UI• Dashboards and visualizations • Free to use • Included in standard packages • Latest version via CDNNCUL1

What You Can Monitor

With Netdata you can monitor all these components across platforms:

ComponentLinuxFreeBSDmacOSWindows
System Resources CPU, Memory and system shared resourcesFullYesYesYes
Storage Disks, Mount points, Filesystems, RAID arraysFullYesYesYes
Network Network Interfaces, Protocols, Firewall, etcFullYesYesYes
Hardware & Sensors Fans, Temperatures, Controllers, GPUs, etcFullSomeSomeSome
O/S Services Resources, Performance and StatusYes systemd---
Processes Resources, Performance, OOM, and moreYesYesYesYes
System and Application LogsYes systemd -journal--Yes Windows Event Log, ETW
Network Connections Live TCP and UDP sockets per PIDYes---
Containers Docker/containerd, LXC/LXD, Kubernetes, etcYes---
VMs (from the host) KVM, qemu, libvirt, Proxmox, etcYes cgroups--Yes Hyper-V
Synthetic Checks Test APIs, TCP ports, Ping, Certificates, etcYesYesYesYes
Packaged Applications nginx, apache, postgres, redis, mongodb, and hundreds moreYesYesYesYes
Cloud Provider Infrastructure AWS, GCP, Azure, and moreYesYesYesYes
Custom Applications OpenMetrics, StatsD and soon OpenTelemetryYesYesYesYes

On Linux, you can continuously monitor all kernel features and hardware sensors for errors, including Intel/AMD/Nvidia GPUs, PCI AER, RAM EDAC, IPMI, S.M.A.R.T, Intel RAPL, NVMe, fans, power supplies, and voltage readings.


Getting Started

You can install Netdata on all major operating systems. To begin:

1. Install Netdata

Choose your platform and follow the installation guide:

[!note] Note You can access the Netdata UI at http://localhost:19999 (or http://NODE:19999 if remote).

2. Configure Collectors

Netdata auto-discovers most metrics, but you can manually configure some collectors:

3. Configure Alerts

You can use hundreds of built-in alerts and integrate with:

email, Slack, Telegram, PagerDuty, Discord, Microsoft Teams, and more.

[!note] Note Email alerts work by default if there’s a configured MTA.

4. Configure Parents

You can centralize dashboards, alerts, and storage with Netdata Parents:

[!note] Note You can use Netdata Parents for central dashboards, longer retention, and alert configuration.

5. Connect to Netdata Cloud

Sign in to Netdata Cloud and connect your nodes for:

[!note] Note Netdata Cloud is optional. Your data stays in your infrastructure.

Live Demo Sites

See Netdata in action
FRANKFURT | NEWYORK | ATLANTA | SANFRANCISCO | TORONTO | SINGAPORE | BANGALORE
These demo clusters run with default configuration and show real monitoring data.
Choose the instance closest to you for the best performance.


How It Works

With Netdata you can run a modular pipeline for metrics collection, processing, and visualization.

flowchart TB
  A[Netdata Agent]:::mainNode
  A1(Collect):::green --> A
  A2(Store):::green --> A
  A3(Learn):::green --> A
  A4(Detect):::green --> A
  A5(Check):::green --> A
  A6(Stream):::green --> A
  A7(Archive):::green --> A
  A8(Query):::green --> A
  A9(Score):::green --> A

  classDef green fill:#bbf3bb,stroke:#333,stroke-width:1px,color:#000
  classDef mainNode fill:#f0f0f0,stroke:#333,stroke-width:1px,color:#333

With each Agent you can:

  1. Collect – Gather metrics from systems, containers, apps, logs, APIs, and synthetic checks.
  2. Store – Save metrics to a high-efficiency, tiered time-series database.
  3. Learn – Train ML models per metric using recent behavior.
  4. Detect – Identify anomalies using trained ML models.
  5. Check – Evaluate metrics against pre-set or custom alert rules.
  6. Stream – Send metrics to Netdata Parents in real time.
  7. Archive – Export metrics to Prometheus, InfluxDB, OpenTSDB, Graphite, and others.
  8. Query – Access metrics via an API for dashboards or third-party tools.
  9. Score – Use a scoring engine to find patterns and correlations across metrics.

[!note] Note Learn more: Netdata’s architecture

Agent Capabilities

With the Netdata Agent, you can use these core capabilities out-of-the-box:

CapabilityDescription
Comprehensive Collection• 800+ integrations • Systems, containers, VMs, hardware sensors • OpenMetrics, StatsD, and logs • OpenTelemetry support coming soon
Performance & Precision• Per-second collection • Real-time visualization with 1-second latency • High-resolution metrics
Edge-Based ML• ML models trained at the edge • Automatic anomaly detection per metric • Pattern recognition based on historical behavior
Advanced Log Management• Direct systemd-journald and Windows Event Log integration • Process logs at the edge • Rich log visualization
Observability Pipeline• Parent-Child relationships • Flexible centralization • Multi-level replication and retention
Automated Visualization• NIDL data model • Auto-generated dashboards • No query language needed
Smart Alerting• Pre-configured alerts • Multiple notification methods • Proactive detection
Low Maintenance• Auto-detection • Zero-touch ML • Easy scalability • CI/CD friendly
Open & Extensible• Modular architecture • Easy to customize • Integrates with existing tools

CNCF Membership

CNCF Logo
Netdata actively supports and is a member of the Cloud Native Computing Foundation (CNCF).
It is one of the most starred projects in the CNCF landscape.


FAQ

Is Netdata secure?

Yes. Netdata follows OpenSSF best practices, has a security-first design, and is regularly audited by the community.

No. Even with ML and per-second metrics, Netdata uses minimal resources.

You can use the Netdata Monitoring section in the dashboard to inspect its resource usage.

How much data retention is possible?

As much as your disk allows.

With Netdata you can use tiered retention:

These are queried automatically based on the zoom level.

Can Netdata scale to many servers?

Yes. With Netdata you can:

You can use Netdata Cloud to merge many independent infrastructures into one logical view.

Is disk I/O a concern?

No. Netdata minimizes disk usage:

You can use alloc or ram mode for no disk writes.

How is Netdata different from Prometheus + Grafana?

With Netdata you get a complete monitoring solution—not just tools.

Performance comparison

How is Netdata different from commercial SaaS tools?

With Netdata you can store all metrics on your infrastructure—no sampling, no aggregation, no loss.

Yes. You can use Netdata together with traditional tools.

With Netdata you get:

You can start small:

Docs and guides

Do I have to use Netdata Cloud?

No. Netdata Cloud is optional.

Netdata works without it, but with Cloud you can:

Anonymous telemetry helps improve the product. You can disable it:

Telemetry helps us understand usage, not track users. No private data is collected.

Who uses Netdata?

You’ll join users including:


📖 Documentation

Visit Netdata Learn for full documentation and guides.

[!note] Note Includes deployment, configuration, alerting, exporting, troubleshooting, and more.


🎉 Community

Join the Netdata community:

[!note] Note Code of Conduct

Follow us on: Twitter | Reddit | YouTube | LinkedIn


🙏 Contribute

We welcome your contributions.

Ways you help us stay sharp:

[!note] Note Contribution guide


📜 License

The Netdata ecosystem includes: