Easy metrics dashboard for multiple systems?

Dataprolet@lemmy.dbzer0.com to Selfhosted@lemmy.world – 35 points –

I'm looking for a simply solution to monitor all my servers and systems using a single dashboard. I want to see metrics like CPU usage, used RAM and storage to see if something is wrong.
I just set up Node-Exporter, Prometheus and Grafana but haven't found an existing dashboard that shows multiple hosts at once. Now I looked into Checkmk and Zabbix but I feel like both are a little overpowered for what I'm looking for. Do you have any recommendations?

18

Question: why don't you crate your own Grafana dashboard? Do you have too many servers?

I don't want to configure a whole Dashboard for at least CPU, RAM, Storage and Network for up to 5 hosts.

You’re likely not going to find a premade dashboard that does exactly what you want, but grafana is extremely powerful if you’re willing to put in the time to learn it. There are ways to visualize things across hosts without having to configure things separately for every host. If you’re using the same mechanism to scrape metrics from each (sounds like you’re using prometheus + node exporter?), this could be as simple as adding a by (node) (or whatever the label name is if it’s not node) grouping to the query on each panel.

Exactly this. We use node exporter, Prometheus, and Grafana at my place of work to get node metrics across our K8s cluster for CPU, memory utilization, file system space, etc.

You’ll have to do some searching and tweaking of existing dashboards, but Grafana is crazy good

It's not that bad. Grafana has a lot of features for this.

I think you can use grafana to present vidgets from different dashboards in one.

Btop running in multiple ssh sessions lol

Edit: lol you said "easy"

This is actually how I do things when working on remote machines. I have far too many monitors, so dedicating on of them to a handful of btop/nvtop terminals works pretty well.

I admit that it's a less than perfect setup though, and a single program which could handle the remote connections internally and display an aggregate would be nice.