Running Uptime-Kuma on Kubernetes #4530

wachtell · 2024-02-26T10:45:01Z

⚠️ Please verify that this question has NOT been raised before.

I checked and didn't find similar issue

🛡️ Security Policy

I agree to have read this project Security Policy

📝 Describe your problem

I am sorry if this has been reported before. I am running Uptime Kuma on a Kubernetes cluster with 3 Servers and 8 agent nodes running on Ubuntu 22.04 with storage on Longhorn persistent volumes. I am getting a lot of timeout of 48000ms exceeded, getaddrinfo ENOTFOUND, Request failed with status code 520 even though the monitored site is up. I have changed the storage to bind-mount on node which helps some also indication that it is a Kubernetes issue.

Can you help me figuring out what I am doing wrong?

📝 Error Message(s) or Log

timeout of 48000ms exceeded
getaddrinfo ENOTFOUND
Request failed with status code 520

🐻 Uptime-Kuma Version

1.23.11

💻 Operating System and Arch

Ubuntu 22.04

🌐 Browser

Firefox 123.0

🖥️ Deployment Environment

Kubernetes Version: v1.27.10 +rke2r1

CommanderStorm · 2024-02-26T10:53:38Z

Can you help me figuring out what I am doing wrong?

Longhorn uses iscaci NFS under the hood, as I understand it.
=> uptime-kuma contains a database
=> you are running a database on a network share
=> possibly the added latency of reads/writes is killing the database performance and not #3515 or

Note that running on a NFS-Style system has soundness bugs with SQLite databases due to faulty file locking, which may lead to corrupted databases.
Please run uptime-kuma on a local volume instead.
See https://github.com/louislam/uptime-kuma/wiki/%F0%9F%94%A7-How-to-Install#-docker and https://www.sqlite.org/howtocorrupt.html#_filesystems_with_broken_or_missing_lock_implementations

I am running Uptime Kuma on a Kubernetes cluster with 3 Servers

HA will not work with uptime-kuma. Please don't run multiple instances of the same docker container as this may corrupt the database.

V2 includes a version to connect to external databases (or continue with the embedded mariadb/sqlite)
See #4500

better performance through external MariaDB, storing heartbeats in an aggregated form, full server-side pagination for important events
=> even though benchmarking is still out, we are confident that this pushes the prominent "limits" (highly hardware-dependent, not a limit imposed by us) ~500 Monitor or ~1.5GB DB-size
A large part of the focus in this release is on performance. We don't know if we are optimising for the right thing, see Can/should we add privacy-respecting usage metrics? #4456 for a discussion on this topic.

In the meantime, choose a lower retention to mask this issue.

wachtell · 2024-02-26T11:09:48Z

@CommanderStorm Thank you for your fast and insightful comments! It is really helpful.

wachtell added the help label Feb 26, 2024

CommanderStorm added the area:core issues describing changes to the core of uptime kuma label Feb 26, 2024

CommanderStorm closed this as completed Feb 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Running Uptime-Kuma on Kubernetes #4530

Running Uptime-Kuma on Kubernetes #4530

wachtell commented Feb 26, 2024 •

edited

CommanderStorm commented Feb 26, 2024

wachtell commented Feb 26, 2024

Running Uptime-Kuma on Kubernetes #4530

Running Uptime-Kuma on Kubernetes #4530

Comments

wachtell commented Feb 26, 2024 • edited

⚠️ Please verify that this question has NOT been raised before.

🛡️ Security Policy

📝 Describe your problem

📝 Error Message(s) or Log

🐻 Uptime-Kuma Version

💻 Operating System and Arch

🌐 Browser

🖥️ Deployment Environment

CommanderStorm commented Feb 26, 2024

wachtell commented Feb 26, 2024

wachtell commented Feb 26, 2024 •

edited