I’m hoping to find something that:

  • has a nice dashboard
  • is quick and simple to install
  • is very lightweight and unobtrusive
  • can send alerts via http request

Edit: Thanks everyone, love this community! I went with Beszel, lots of other good recommendations too

  • @[email protected]
    link
    fedilink
    25 months ago

    While I use LibreNMS as it uses SNMP for monitoring (which is pretty much available everywhere), I don’t believe it has http alerts, but I know for a fact that it can send Telegram messages.

  • @[email protected]
    cake
    link
    fedilink
    115 months ago

    I personally use CheckMK.

    • Offer a free “Raw” version.
    • Can be deployed with docker.
    • OSS

    One thing is that it can be a lot to take in at first and took me a while to get used to it.

    • @[email protected]
      link
      fedilink
      English
      15 months ago

      checkmk user here. i can second the adjustment phase. i tend to ignore my servers but when something goes sideways it’s awesome to have checkmk’s structure in place.

    • @[email protected]
      link
      fedilink
      English
      25 months ago

      CheckMk user here via omd.

      I’m looking for something else after the upgrade.

      1. Black interface isn’t pretty for me and the old interface was “meh too hard so we ditched it”.

      2. One half of the project split has a shit supply chain and just doesn’t meet the bar for upgrade requirements.

      3. The other half of the project split is a mess to config in an automated desired-state setup. It’s all edge-triggered manual bullshit. NO. ENOUGH.

      I miss 1.2 .

  • @[email protected]
    link
    fedilink
    55 months ago

    Nagios. It does depend on what you mean by monitor though. Nagios is good at telling you that “service A on host B” is down" but less useful for looking at things like performance trends. I particularly like being able to setup dependencies between services, so I get the alert for the root cause, and not all of the services that have gone down because of it.

  • @[email protected]
    link
    fedilink
    English
    105 months ago
    • Base ansible role installs Prometheus node exporter, configured with the text file collector
    • VM automations push DNS records so that the Prometheus dns-sd automatically discovers them
    • Ansible roles for add Cron jobs that generate metrics for specific systems and dump them for the text file collector
    • Grafana for dashboards
    • Karma as a UI in front of Prometheus alert manager
    • @[email protected]
      link
      fedilink
      15 months ago

      Cron jobs that generate metrics for specific systems and dump them for the text file collector

      Details please

      • @[email protected]
        link
        fedilink
        English
        25 months ago
        • https://github.com/prometheus/node_exporter?tab=readme-ov-file#textfile-collector - which makes node exporter watch a specific directory for files that contain metrics, then re-export them back to the central Prometheus server
        • Some systems have their own metrics endpoints - instead of getting Prometheus to scrape these directly I set up a Cron job to curl these into files for node exporter - this means I don’t need extra config in Prometheus to find the endpoints, and don’t need to mess with firewall rules
        • Other systems don’t directly expose metrics in a format Prometheus can use - in this case I will write/find a script that can do the conversation, then either set it up to write the metrics file directly and run it on a Cron, or run it as a service and another Cron job to do the scrape
    • @[email protected]
      link
      fedilink
      English
      15 months ago

      Any chance you’d be willing to share playbooks or point me toward any resources you used?

      I use Ansible to manage config across all my workstations/servers but I haven’t gotten around to automating log shipping yet or aggregating system metrics.

  • tath
    link
    fedilink
    85 months ago

    Zabbix is pretty quick and easy. Many different services built in for sending notifications, along with your own custom (including webhooks). Fully customizable dashboard as well so you can add whatever you want/need at a glance.

    • 8adger
      link
      fedilink
      English
      15 months ago

      I stopped by to say the same thing. I use Zabbix to monitor everything

  • hindy
    link
    fedilink
    25 months ago

    Hello,

    I’m still using Nagios here. And for the availability of the services I’m using uptime-kuma (in a docker).

  • Phoenixz
    link
    fedilink
    8
    edit-2
    5 months ago

    We just recently started using zabbix. Open source and has a web interface to get a central view that can be accessed from wherever we allow it.

    So far it’s been great but er have had little time and so far have used only 1% of what it can do

    Still, I’d recommend it. Super easy to install, seems light weight, has clients for any os you’d need, can send out alerts (we currently use pushover for that)